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Abstract — We address single-user data transmission over a 
channel where the received signal incurs interference from a finite 
number of users (interfering users) that use single codebooks 
for transmitting their own messages. The receiver, however, is 
allowed to decode interfering users' messages. This means the 
signal transmitted from any interfering user is either decoded or 
considered as noise at the receiver side. We propose the following 
method to obtain an achievable rate for this channel. Assuming 
its own data is decoded successfully, the receiver partitions the 
set of interfering users into two disjoint subsets, namely the set 
of decodable users and the set of non-decodable users. Then the 
transmitter's rate is chosen such that the intended signal can 
be jointly decoded with the set of decodable users. To show 
the strength of this method, we prove that for the additive 
Gaussian channel with Gaussian interfering users, the Gaussian 
distribution is optimal and the achievable rate is the capacity of 
this channel. To obtain the maximum achievable rate, one needs 
to find the maximum decodable subset of interfering users. Due to 
the large number of possible choices, having efficient algorithms 
that find the set of decodable users with maximum cardinality is 
desired. To this end, we propose an algorithm that enables the 
receiver to accomplish this task in polynomial time. 

Index Terms — Interference channel, capacity region, submod- 
ular functions, combinatorial optimization. 

I. Introduction 

TREATING interference as noise is not always the best 
strategy. Since the interference caused by another user 
is intrinsically a codeword from a codebook, it is possible 
to decode the interference at the receiver side which results 
in transmitting at higher rates. Whereas, if the interference is 
considered as noise, the structure of interference is ignored 
resulting in lower data rates. Moreover, there are multi- 
user channels whose capacity regions are achieved when the 
interference is decoded by some or by all receivers. Examples 
of such channels are the degraded Broadcast Channel (BC) 
and strong two-user Gaussian Interference Channel (IC), c.f. 
[1H3]. 

Decoding interference caused by interfering users is ad- 
dressed in different papers. As an example, the best inner 
bound for the two-user IC is the Han-Kobayashi achievable 
rate region [4] which is obtained by combining the multilevel 
coding and joint typical decoding. Transmitters split then- 
data into two parts where independent codebooks are used 
to transmit each part. Receivers jointly decode their own data 
and some part of the data by the unintended user. 

Focusing on a particular transmit-receive pair in a multi-user 
system, we obtain a single-user channel where the received 
signal incurs interference from a number of interfering users. 
We assume that interfering users use single codebooks, which 



are generated independent of each other. Having information 
about the rates and codebooks of interfering users, the receiver 
is allowed to decode interfering messages. This in turn means 
that the signal transmitted from any interfering user is either 
decoded or considered as noise. 

We propose the following method to obtain an achievable 
rate for the channel. Assuming its own data is decoded 
successfully, the receiver finds the maximum decodable subset 
of interfering users. By a maximum decodable subset, we 
mean a set of users that are decodable at the receiver, regarding 
the rest as noise and any decodable set is a proper subset of 
it. It is shown that this task can be accomplished by using 
a polynomial time algorithm. Once the receiver obtains the 
maximum decodable subset, it can partition the interfering 
users into two disjoint subsets, namely decodable users and 
non-decodable users. Then, the transmitter's rate is chosen 
such that the intended signal can be jointly decoded with the 
set of decodable users. We also propose a polynomial time 
algorithm to find the maximum achievable rate obtainable by 
this method. 

To show the strength of this method, we prove that for the 
additive Gaussian channel with Gaussian interfering users, the 
Gaussian distribution is optimal and the achievable rate is the 
capacity of this channel. 

As an application, we use this model to characterize some 
achievable rate points for the M-user Gaussian IC. In general, 
we are interested in characterizing achievable rate region for 
the M-user IC where each transmitter is allowed to transmit 
data by using a single codebook and each receiver is allowed 
to decode any subset of interfering users. Therefore, at each 
receiver, the signal transmitted from any interfering user is 
either decoded or considered as noise. 

Despite the two-user case, the general A/-user IC is less 
studied in the literature. The state of the art work for deriving 
achievable rate vectors treats interfering users as noise [5]- 
[13]. For example, in [6] the M-user Gaussian IC is studied 
where transmitters are allowed to allocate different powers in 
different bandwidths and receivers treat interference as noise. 
Recently, in [14], [15], successive interference cancelation is 
studied. For example, in [14] the optimal order of decoding 
that maximizes the minimum rate among all users is obtained. 

The organization of this paper is as follows. In Section II, we 
introduce the system model and some background materials. 
In Section III, we consider a discrete memoryless channel 
consisting of M transmitters and one receiver. We assume that 
the users' rate vector is not necessarily inside the capacity 
region of the Multiple Access Channel (MAC) seen at the 
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receiver side which results in failure of the receiver to reliably 
decode all the data streams. The receiver's task, however, 
is to find a maximum decodable subset of transmitters so 
that their data can be decoded from the received signal. 
We propose a polynomial-time algorithm which finds the 
maximum decodable subset of users. 

In Section IV, we consider single-user data transmission 
over a channel with M — 1 interfering users. We first obtain 
a lower bound and an upper bound on the capacity of this 
channel. Then, we propose a method that characterizes an 
achievable rate for the channel. This achievable rate is a 
function of other users' rates. We then prove that this function 
is piecewise linear. 

In Section V, we consider additive channels where the 
interference caused by other users is Gaussian. We prove that 
for this case, the Gaussian codebook achieves the capacity 
where each interfering user is either decoded or treated as 
noise by the receiver. 

In Section VI, we investigate applications of the proposed 
algorithms to the M-user Gaussian IC. We first develop a 
polynomial time algorithm that characterizes points obtain- 
able from successive maximization of users' rates. We then 
generalize the notion of one-sided Gaussian ICs to the Ab- 
user case, and characterize a point on the boundary of the 
capacity region. We finally obtain the capacity of the strong 
one-sided M-user Gaussian IC. In Section VII, we conclude 
the paper. 

Notations: Throughout this paper, we use the following no- 
tations. Vectors are represented by bold faced letters. Random 
variables and sets are denoted by capital letters where the 
difference is clear from the context. The difference, union, 
and intersection of two sets U and V are represented by 
U\V, U U V, and U fl V, respectively. The complement 
of a subset U is denoted by U. The cardinality of a set 
U is denoted by \U\. M is always the number of users in 
the system. We use E to denote the set {1,2, ...,M}. 2 E 
denotes the power set of E which is the collection of all 
subsets of E. For any set S C E and any vector x = 
[xi, %2, ■ ■ ■ , xm], we use the compact notations x(S) and xs 
to denote J2ies Xi an< ^ [ x i\i&S, respectively. In particular, 
x_j = x^ = [xi, . . . ,Xi-x,x i+ i, . . . ,x M \- 3? is the set of 
real numbers and $l k denotes a fc-dimensional Euclidean space. 

II. Preliminaries 

A. System Model 

We consider single-user data transmission over a channel 
with M — 1 interfering users. 5? is specified by the transition 
probability function u>(y\ \x±, X2, ■ ■ ■ ,xm) where £ 3£i is 
the input letter to the channel from the i'th user and y\ s <3f\ 
is the output letter received by the receiver, see Figure Q] The 
set of users' indices is denoted by E. x\ is the input letter 
from the intended user and Xi for i = 2, 3, . . . , M are input 
letters from interfering users. We assume that the interfering 
users transmit data at the rate vector R_i = [R 2 , R3, . . . , Rm] 
by using single codebooks generated randomly from the joint 
probability distribution p X2 (x 2 )px 3 (^3) ■ ■ -Px m { x m)- We are 
interested in characterizing the capacity of this channel. 
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Fig. 1 

Single user in an interfering medium, xi is the input letter 

FROM THE INTENDED USER AND Xi, i = 2,3,..., M , IS THE INPUT 
SYMBOL CORRESPONDING TO THE i'TH INTERFERING USER. 



We also consider the continuous Gaussian case modeled by 



yi = xi + x 2 + ■ ■ ■ + xm + z, 



(1) 



where x\ and y\ denote transmitted and received symbols, 
respectively. Xi, i = 2, 3, ... , M, is the input symbol corre- 
sponding to the i'th interfering user that uses a single Gaussian 
codebook with power Pj and rate z is the additive white 
Gaussian noise with variance N. The transmitter is subject to 
the average power Pi and tries to send data at the maximum 
rate R\. 

B. Submodular Functions 

Definition 1: Let £ be a finite nonempty set. A function 
/ : 2 E — > R is called a submodular function if it satisfies 



f(v u u) + f(v n u) < f(v) + /([/), 



(2) 



for any V, U C E. A function / is called suppermodular if 
— / is submodular. A modular function is a function which is 
both submodular and suppermodular. 

Submodular functions are one of the most important objects 
in discrete optimization. In fact, they play the same role in 
discrete optimization as convex functions do in the contin- 
uous case [16]. Besides having a polynomial-time algorithm 
based on the ellipsoid method [17], there are combinatorial 
algorithms for minimizing submodular functions in strongly 
polynomial time, c.f. [16] and [18]. 

If a submodular function is nondecreasing, i.e. f(U) < 
f(V) if U C V, and f(0) = 0, then the associated polyhedron 



B{f) = {x|x(f/) < /([/), W C E,x > 0}, 



(3) 



is a polymatroid. Likewise, if a suppermodular function is 
nondecreasing and /(0) = 0, then the associated polyhedron 



Q{J) = {x|x(C7) > f(U), W C E}, 
is a contra-polymatroid. 



(4) 



C. Properties of Mutual Information for Independent Random 
Variables 

In this subsection, we review some important equalities and 
inequalities in Information Theory. We consider M indepen- 
dent random variables X\, X 2 , ■ ■ ■ , Xm- Moreover, let E = 
{1, 2, . . . , M} denote the set of random variables' indices. For 
any random variable Y, we have the following properties: 
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1) Chain Rule: For any disjoint subsets U and V, we have 
the following inequality: 

I(X UuV ; Y) = I(X v :Y\Xu) + I(X u; Y). (5) 

2) Independent Conditioning Inequality: For any disjoint 
subsets U and V, the following inequality holds: 

I(Xu;Y) < I(Xu;Y\X v ). (6) 

3) Polymatroidal Property: In [19], it is shown that the set 
function u(U) = I(Xu;Y\Xjj) is submodular and nonde- 
creasing, i.e., 

a{U UV)+ a(U n V) < a(U) + a(V), W, V C E. (7) 

Hence, its associated polyhedron is a polymatroid. 

4) Contra-polymatroidal Property: We claim that the set 
function p defined as p{U) = I(Xu;Y) is a suppermodular 
function. To this end, fix any arbitrarily subsets U and V. Let 
S = U fl V. From the chain rule, we have 

I(X UUV ; Y) = I(Xu;Y) + I(X vXU ;Y\Xu), (8) 

which can equivalently be written as 

p(U UV) = p(U) + I(X V \ S ; Y\X V \ S , X s ). (9) 

From Independent Conditioning Property, we have 
J(X ns ;y|X s ) < I(X V \ S ;Y\X UXS ,X S ). Hence, 

p(UUV)>p(U)+I(X v \ s ;Y\X s ). (10) 

Adding p(U H V) = p(S) to both sides, we obtain 

p(UUV)+p(UnV) > p(U)+I(X v \ s ;Y\X s )+I(X s ;Y). 

(11) 

Since I(X V \ S ; Y\X S ) + I(X S ;Y) = I(X V ; Y), we have 

p(uuv) + p(unv)>p(u) + p(v), (12) 

as claimed. It is easy to show that p is nondecreasing and 
hence its associated polyhedron is a contra-polymatroid. 

D. Multiple Access Capacity Region 

One of the most important results in Information Theory 
is the characterization of the capacity region of the MAC 
[20], [21]. The capacity region of a MAC can be represented 
as follows. We define & as the collection of all probability 
distributions which can be written as P(xi, x 2 , ■ ■ ■ , %m, v) = 
p(xi)p(x 2 ) ■ ■ ■p(x M )^(y\xi,x 2 , ■ • . ,x M ), where 
oj(y\xi,X2, ■ ■ ■ ,xm) is the channel transition probability 
function. Now, the capacity region of a MAC is 

Cmac =conv j |J Cmac(P)) , (13) 

where conv(-) denotes convex hull operation, and Cmac(P) is 
defined as 

Cmac(P) = {R|R((7) < I(Xu;Y\X w ), V[/C£}. (14) 

Using the polymatroidal property of the mutual information, 
it is easy to show that Cmac(P) is a polymatroid. It is worth 
noting that even though Cmac is the union of polymatroids, it is 
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Fig. 2 

Transmitter i uses a random codebookfor data transmission 
at rate Ri . Receiver's task is to find the maximum decodable 

SUBSET OF USERS. 



not necessarily a polymatroid. However, Cmac is a polymatroid 
for the M -user Gaussian MAC modeled by 

y = xi + x 2 + ■ ■ ■ + xm + z, (15) 

where y is the received symbol, Xi is the transmitted symbol of 
user i, and z is additive white Gaussian noise with zero mean 
and variance N. User i is also subject to an average power 
constraint Pi. The capacity region of the M-user Gaussian 
MAC can be stated as 

Cgmac = {R\R(U) < 7 (^p^j ,VUCE} : (16) 

where j(x) — 0.51og 2 (l +x). 

III. Maximum Decodable Subset 

In this section, we consider a discrete memoryless channel 
consisting of Al transmitters with input alphabet 3£i for the 
zth transmitter and one receiver with output alphabet <@f where 
each transmitter uses a single codebook for data transmission. 
This channel is specified by the transition probability function 
w(y|xi, X2, ■ ■ ■ , xm) where Xi £ 2£i is the input letter to the 
channel from the ith transmitter and y <E *3/ is the output 
letter received by the receiver, see Figure [2] The random 
codebooks used for data transmission at the rate vector R = 
[Ri, i?2, • • • , Rm] are generated by using the joint probability 
distribution px x (xi)px 2 (^2) • • • Px M ( x m) f° r random vari- 
ables X U X 2 ,. ■ ■ ,X M - 

The rate vector R may fall outside of the capacity region of 
the MAC seen at the receiver side which results in failure to 
reliably decode all data streams. The receiver's task, however, 
is to find a decodable subset of transmitters so that their data 
can be decoded from the received signal. To this end, the 
receiver partitions the set of transmitters into two disjoint parts 
and tries to jointly decode the data sent by the transmitters 
within the first partition, while considering the signals of 
transmitters in the second partition as noise. 

In what follows, we compute the complexity of finding a 
decodable subset of transmitters by an exhaustive search. Let 
E = {1,2, ... ,M} denote the set of transmitters' indices. 
There are 2 M ways to partition E into two subsets; and to 
verify that a subset V with cardinality k is decodable, 2 fe — 1 
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inequalities must be checked due to ( TBI . Hence, in general, 
the total number of inequalities to be verified is 

£) (*fW - 1) = 3* - 2*, 

k=0 ^ ' 

which is exponential in the number of users. 

Definition 2 {Maximum decodable subset): A set of trans- 
mitters is a maximal decodable subset if all transmitters in 
the subset are jointly decodable by the receiver, and is not a 
proper subset of any other decodable subset. If the maximal 
decodable subset is unique, we call it the maximum decodable 
subset. 

Lemma 1: For any channel, there is a maximum decodable 
subset. 

Proof: Suppose the receiver is able to decode two subsets 
of transmitters, namely U and V, such that none of them is a 
subset of the other. U and V are proper subsets of their union 
U U V. Besides, their union is decodable by the receiver. This 
contradicts the fact that both subsets are maximal. □ 

We first describe some properties of the maximum decod- 
able subset. There are two cases of special interest. The first 
case occurs when all transmitters are decodable by the receiver, 
i.e., the maximum decodable subset is the set E. In this case, 
the transmitters' rates must satisfy the inequalities given in 
( fT4t . In the second case, however, none of the transmitters is 
decodable by the receiver, i.e., the maximum decodable subset 
is empty. The following Lemma shows that for the second case 
the rate vector R must be in a certain contra-polymatroid. 

Lemma 2: None of the signals is decodable by the receiver 
if and only if transmitters' rates satisfy 

R(£/) > I(Xu;Y), V U C E. (17) 

Moreover, the region of the rate vectors satisfying above 
inequalities forms a contra-polymatroid. 

Proof: We first prove that if a rate vector R satisfies ( TTTl ). 
then none of the signals are decodable. To this end, we assume 
that V is the maximum decodable subset and V ^ 0. Since 
V is a decodable subset, we have the following constraints on 
the rates of the members of V. 

R(T) < J(X T ; Y\X V \ T ), V T C V. (18) 

By substituting T = V in the above equation, we have 

R(V) <I(X V ;Y), (19) 

which is a contradiction and this completes the "if" part of 
the proof. 

Next, we need to prove that if the inequalities in (Tl\ 
are not satisfied, there is at least a transmitter which is 
decodable. Suppose there are some subsets that do not satisfy 
dl7t . Assume W has the minimum cardinality among all and 
satisfies 

R(W) < I(X W ;Y), (20) 

If \W\ = 1, then the transmitter in W is decodable by 
considering everything else as noise which is the desired result. 
Hence, we assume \W\ > 1. If all members of W are jointly 



decodable, then we have found a decodable subset. Otherwise, 
there must be a subset of W, say V, satisfying 

R(V) > I(X V ;Y\X WXV ). (21) 

By decomposing the mutual information in (1201 . we obtain 

R(W) <I(X V ;Y\ Y). (22) 

From the minimality of \W\, we have 

TL(W\V) >I(X W \ V ;Y). (23) 

By combining the two inequalities (f2TT > and ( 1231 and consid- 
ering the fact that TL(W) = R(W\V) + R(V), we conclude 
that 

R(W)> I(X W \ V ;Y) + I(X V ;Y\X W \ V ), (24) 
>I{X W ;Y), (25) 

which is a contradiction. This completes the "only if" part of 
the proof. 

It is easy to see that the function on the right hand side 
of ( flTt is a suppermodular function and monotone, hence the 
region formed by rates satisfying ST% is a contra-polymatroid. 

□ 

In the following theorem, the characterization of the maxi- 
mum decodable subset is presented. 

Theorem 1: A subset S C E is a maximum decodable 
subset if and only if the transmitters' rates satisfy the following 
inequalities 

K(V)< I(X V :Y\X S \ V ), VVCS, (26) 

K{U)> I(Xu;Y\X s ), VC/CS. (27) 
Proof: Inequality d26l > corresponds to the capacity region 
of the MAC for members of S considering members of S 
as noise. Hence, the members of S are decodable iff the 
inequalities in d26l > are satisfied. The set S is a maximum 
decodable subset if no other transmitters in S is decodable by 
the receiver. Now, by applying Lemma [2] and considering that 
all members of S are decoded, we conclude that none of the 
transmitters in S is decodable iff the inequalities in ( |27T i are 
satisfied. This completes the proof. □ 
For a given maximum decodable subset S C E, we define 
D s as 

D s = {R|R(T) < I(X V ;Y\X S \ V ), VVCS, 

R(t/) > I{Xu;Y\X s ), Vt/CS}. (28) 

D s is a polyhedron because it is the intersection of finitely 
many half spaces. By Theorem Q] D s consists of all rate 
vectors with the same maximum decodable subset S. Since 
for any rate vector there is an associated maximum decodable 
subset, we have U S ceD s = Ulf. This means that is 
represented as the union of finitely many polyhedral sets. An 
example for the case of the additive two-user Gaussian channel 
is given in Figure [3] 

The result of this section can be directly extended to 
continuous channels. The most applicable class of continuous 
channels is the additive Gaussian channel defined by 

y — xi + x 2 + • • • + x M + z, (29) 
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Fig. 3 

Decision regions used for determining the maximum decodable 

subset for a two-user additive gaussian channel. for any 
rate in d* 1 ' 2 }, the receiver can decode both signals. for rates 
in d{ 1 ^ and d^ 2 } , the receiver is able to decode transmitters 
1 and 2, respectively. finally, the receiver can decode neither 
1 nor 2 for any rate in d . 



where z is an additive Gaussian noise with zero mean 
and variance N. We assume users transmit at rates R = 
[Ri , . • • , Rm] using Gaussian codebooks with average powers 
P = [Pi, ... , Pm] ■ In the following example, we apply the 
result of Theorem Q] to a two-user additive Gaussian channel. 

Example 1: Consider a two-user additive Gaussian channel 
where the received signal can be written as y = x\ + X2 + z. In 
this case, E has four subsets, namely Si = {1, 2}, 52 = {1}, 
S3 = {2}, and S4 — 0. By applying Theorem [T] we obtain the 
following conditions for the subsets of E to be the maximum 
decodable subset. 

1) Si is the maximum decodable subset. In this case, the 
conditions Ri < j(j^), R2 < and Ri + R 2 < 
j( Px x P * ) must be satisfied. 

2) S2 is the maximum decodable subset. In this case, the 

conditions i?i < 7 ^ p^+n ) an( ^ ^ 2 > T (lv ) must t> e 
satisfied. 

3) S3 is the maximum decodable subset. In this case, the 
conditions R2 < 7 ^ p^+n J an d > 7 {jf) must be 
satisfied. 

4) S4 is the maximum decodable subset. In this case, the 
conditions i?i > 7(73^^), R 2 > 7(p^n)> and R i + 
R2 > l( Pl N P ' 2 ) must be satisfied. 

The set of conditions described above partitions -fti into 
four regions, as illustrated in Figure [3] It can be seen from 
the figure that D^- 1 ' 2 ^ is a polymatroid corresponding to the 
capacity region of a two-user MAC and D is a contra- 
polymatroid according to Lemma |2] 

The above example shows that finding the maximum de- 
codable subset is equivalent to finding the region where the 
transmitters' rate vector belongs to. Since the number of 



Fig. 4 

E IS THE GROUND SET. S IS THE MAXIMUM DECODABLE SUBSET. W IS 
THE MINIMIZER OF / IN (TT\ . 



regions grows exponentially with the number of transmitters, 
finding a polynomial-time algorithm for solving the problem 
is desired. To this end, we first define the function / : 2 E — > K 
as follows 

f(V)=I(X v ;Y\X v )--R(V), (30) 

where V C E. 

Lemma 3: The function / defined in (f30b is a submodular 
function. 

Proof: The result directly follows from the modularity 
of R and the submodularity of mutual information. □ 
Since there are polynomial-time algorithms for minimizing 
any submodular functions, c.f., [16] and [18], the following 
optimization problem can be solved efficiently: 

f(W) = nnnf(V). (31) 

If the minimum of / in OTT ) is zero, then all transmitters are 
decodable by the receiver due to ( TT~4-b . Otherwise, there is at 
least a transmitter of the set E which is not decodable. In 
the following theorem, we prove that indeed all members of 
the minimizer of / are not decodable, and they need to be 
considered as noise. 

Theorem 2: No member of the subset W that minimizes / 
in OTT ) is decodable by the receiver, provided that the mini- 
mum in OTT ) is not zero and the minimum cardinal minimizer 
is used. In fact, all members of W must be considered as noise, 
i.e., if S is the maximum decodable subset then W D S = 0. 

Proof: We first partition the minimizer subset W into 
two disjoint sets U and T where U — W (7 S and T = W\S, 
see Figure |U We need to show that U = 0. Suppose U is 
nonempty. Hence \U\ > 1 and |T| < \W\. Since U is a subset 
of S, from d26l i. we have 

R(L7) < I(Xu;Y\X sxu ). (32) 

The inclusion S\U C W and independence of random 
variables imply I(X.u;Y\K s \u) < I(X. V ',Y\X W ). Hence, 

R(f7) < I(Xu;Y\X w ). (33) 

From the definition of / in d30b . we have 

f(W) = I(X W ;Y\X W ) - TL(W). (34) 
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From the chain rule and the fact that T and U partition W 
into two disjoint subsets, we have the following equation 

I(X W ;Y\X W )= I(X T ;Y\X W , X v ) + I(X u; Y\X w ), 

= I(X T ;Y\X T ) + I(Xu;Y\X w ). (35) 

Substituting (EH into Oil and using R(W) = R(T)+R([7), 
we obtain 

f(W) = f(T) + I(X u ;Y\X w )-R(U). (36) 
Using the inequality d33l l. we conclude that 

f(T) < f(W). (37) 

If f(T) < f(W), then it contradicts the optimality of W, and 
if f(T) = f(W), then it contradicts the fact that \W\ has 
minimum cardinality among all minimizers. This completes 
the proof. □ 

By applying Theorem [2] and using the well-known submod- 
ular function minimization algorithms as a subroutine, c.f. [18] 
and [16], we propose the following polynomial-time algorithm 
for finding the maximum decodable subset. 

Algorithm 1 (Finding the maximum decodable subset): 

1) Set S = E. 

2) Find W such that 

f(W) = minf(V), 

where / is 

f(V)=I(X v ;Y\X S \ v )-'R(V). (38) 

3) If W = STOP. S is the maximal decodable subset. 
Otherwise, S\W — ► S. 

4) If S = STOP. No subsect of E is decodable. 
Otherwise, GO TO step 2. 

Theorem 3: Algorithm[T]converges to the maximum decod- 
able subset in polynomial time. 

Proof: Since in each iteration W is a nonempty set 
(otherwise, the algorithm stops), this algorithm converges at 
most in \E\ iterations. Furthermore, in each iteration, we need 
to minimize a submodular function which can be done in 
polynomial time [16]. Hence, the total running time of the 
algorithm is polynomial in time. □ 

IV. An Achievable Rate 

In this section, we propose a method to obtain an achiev- 
able rate for the channel 5? . We also provide a polynomial 
time algorithm to characterize this achievable rate. A lower 
bound for the capacity of can be obtained by considering 
interfering users in E as noise and optimizing over all input 
distributions. Hence, we have 

maxI(Xi;Yi) < C, (39) 

p(zi) 

where C denotes the capacity of 5? . Now, we assume that 
regardless of the input distribution, the receiver is able to 
decode all interfering users considering its own signal as noise. 
By this assumption, an upper bound on the capacity can be 
obtained as follows 

C<max/(X 1 ;r 1 |X B \ 1 ). (40) 

p(*i) 



Let us assume that the transmitter uses px x {x\) to generate 
a single random codebook. We need to find the maximum 
achievable rate R\. If R\ is an achievable rate, then the 
receiver can successfully decode its intended data. After 
decoding its own signal, the receiver can search in the set 
for the maximum decodable subset S C E\{1}. This 
procedure can be done efficiently using Algorithm Q] Let us 
define V — E\(S U {1}). V is the set of users that receiver 
treats as noise. From d26b . we have 

R(U) < J(Xir;Yi|X Su{1}V ,), VU C S. (41) 

To find Ry, we consider the MAC consisting of user 1 and 
the users in S, while the users in V are considered as noise. 
From (fT4l >. the rate vector R is achievable if 

R(U) < IiXu-Y^Xsu^u), VC/C5U{1}. (42) 

Since half of the inequalities in d42l are satisfied by (PTlT i and 
the only unknown parameter is R\, we can maximize the user's 
rate based on the following optimization problem: 

Rt (R_i)= mm I{X X , X v ; F^Xg^) - R(U). (43) 

The optimization problem (l43l is again a submodular function 
minimization and can be solved by polynomial-time algo- 
rithms. 

In the following, we summarize the above procedure. 
Algorithm 2 (finding an achievable rate): 

1) Given p{x\), find the maximum decodable subset S 
among interfering users by using Algorithm Q] and 
assuming that the user's data is decoded. 

2) Solve the submodular function minimization in (l43l . 
As a by-product of the above algorithm, we can find the 

subset of interfering users that can be first decoded at the 
receiver and its effect can be removed. 

Proposition 1: If U minimizes d43l . then the receiver is 
capable of decoding all users in W = S\U by considering 
everything else as noise. 

Proof: At the first step, one needs to decode W. This 
requires, 

R(T) < I(X T ; Yx\X W \ T ), VT C W. (44) 

Suppose there is a subset T* that does not satisfy (PPfl i. that 

is, 

R(T*) >I(X T *;Y 1 \ )■ (45) 

Hence, 

Rt =I(X 1 ,X UUT ,;Y 1 \X SWUT , ) ) - R(U U T*) 

( = ) /(X 1 ,X c/; y 1 |X sw )-R((7) 
+I(X T *;Y 1 \X W \ T *) -R(T*) 

( = 3 i?i(R-i) + J(X T * ; Yx \X W \ T , ) - R(T*) 

( <J2i(R_i), (46) 

where (a) follows from the chain rule and the fact that 
(S\(UUT*))UT* = S\U and S\(UUT*) = W\T\ (b) 
follows from the definition of i?i(R_i) and minimality of U, 
and (c) follows form (l45l l. The last inequality contradicts the 



7 



fact that U is the solution for the minimization problem in 
( |43l >. This completes the proof. □ 

In light of Proposition [Tj the set E is decomposable into 
three disjoint subsets, namely V, U U {1}, and W. V is the 
complement of S U {1}, namely the union of the maximum 
decodable subset S and the intended user. Therefore, the 
receiver is not able to decode the interfering users in V and 
considers them as noise. W is the part of S that the receiver 
can decode by considering everything else as noise. U U {1} 
is the subset of users that need to be decoded jointly after 
removing the effect of W . 

As indicated in (I43t . the achievable rate is a function of 
interfering users' rates. In order to derive some properties of 
this function, we need the following definition. 

Definition 3 (piecewise linear functions [22]): A function 
/ : 3? — > 3R: is piecewise linear if firstly its domain can 
be represented as the union of finitely many polyhedral sets, 
and secondly / is "affine" within each polyhedral set, i.e., 
/(x) = a T x + b for some vector a and scalar b. 

In the following theorem, we summarize some properties of 
Pi as a function of R_i. 

Theorem 4: The function Pi(R_i) defined in d43b is piece- 
wise linear. More precisely, Pi(R_i) consists of at most 
3 M_1 collection of affine functions. 

Proof: Likewise d28l ). let us define the region D s as 

D s = {R_i|R(T) < I(^ t ;Y 1 \X s \t,X 1 ) ) VTCS, 

R(U) > /(X^YilXg,*!), V U C V}(47) 

where V = E\(SU{1}). Due to g3), the function Pi(R_i) 
is defined as the point wise minimum of 21 s ' affine functions 
over the polyhedral set D s . As a result, Pi(R_i) is piecewise 
linear, continuous, and concave over D s , c.f., Theorem 2.49 
in [22]. 

Since Pi(R_i) is a piecewise linear function over each D s 
and Usce\{i}D s = it is a piecewise linear function 

over . Moreover, each polyhedron D s is divided into 

at most 2 I s I sub-polyhedra in each of which Pi is an affine 
function. Hence, the total number of components is not more 
than 

|S|=0 V 151 J 
This completes the proof. 

□ 

It is worth noting that, although Pi is a concave function 
over each D s ', it is not a concave function over 

Example 2: Consider an additive channel y\ = x\ + x 2 + 
X3 + z\ where all users use Gaussian codebooks for data 
transmission. In this case, the maximum decodable subset 
of interfering users is a subset of {2, 3}. Hence, there are 
four regions P> , D^ 2 \ and where Pi(P 2 ,P 3 ) 

is a concave function over each of them. For instance, 

Ri(R 2 ,Rs) = i( p 2+ pI +Ni ) over D$ and Ri(R^Rs) = 

7 ( p7+n1 ) ~ 5(^3) over where 5(^3) is either R 3 or 
0. In Fig. [5] an example of the function Ri{R 2 , R3) for this 
channel is illustrated. As depicted in the figure, Ri(R 2 , R3) 




Fig. 5 

The function Ri (R_ 1 ) for a channel with two interfering 

USERS 



is a piecewise linear and continuous function. It also consists 
of 9 components, i.e., 3 M_1 for M = 3. 

Example 3: In this example, we consider binary adder 
channel with M — 1 interfering users. The channel model can 
be written as y\ — x\ © x 2 © . . . © xm- We further assume 
that users' codebooks are randomly chosen from Bernouli 
sequences with p(0) — p(l) = 0.5. In this case, it is easy 
to show that 

i? 1 (R_ 1 ) = [l-R(iA{l})] + , (49) 

where [a} + = a if a > and otherwise. This reflects the 
fact that the function i?i(R_i) may have less than 3 M_1 
components. 

V. Channel's Capacity for the Gaussian Case 

In this section, we prove that provided using Gaussian dis- 
tribution for codebook generation, the achievable rate obtained 
in the previous section is indeed the capacity of the additive 
channel with Gaussian noise and M — 1 Gaussian interfering 
users. 

To show that any rate above C (the output of Algorithm 
[2] where p{x\) is Gaussian) is not achievable, we construct a 
degraded broadcast channel and show that if a rate R\ > C 
is achievable, then one can communicate reliably outside the 
capacity region of this channel which is a contradiction. The 
following lemma assists us in constructing such a degraded 
channel. 

Lemma 4: For any set of independent Gaussian codebooks 
with power vector P = [Pi, P 2 , . . . , Pk] and rate vector 
R = [Pi, R 2 , . . . , Rk], there is a K-user Gaussian broadcast 
channel with the following properties: 

1) The transmitter's total power is P(P). 

2) There are L noise levels: Ni < N 2 < . . . < N^. 
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3) Users are partitioned into L disjoint subsets, that is, E = 
UiLi Ui' AH users in Ui have the same noise level Ni, 
fori = {l,2,...,i}. 

4) The rate vector R lies on the boundary of the capacity 
region. R is achievable using Gaussian codebooks with 
powers in one to one correspondence with the compo- 
nents of P. 

Proof: We aim at building a Gaussian broadcast channel 
with x as input and J/i>$2, • • • ,Vl as outputs, where iji = 
x + rii and rn is additive white Gaussian noise with variance 
Ni, To this end, assume there is a Jf-user Gaussian MAC 
with noise level TV and transmit power vector P. Then, R is 
achievable if 



that 



R(T) < 7 



gen 

TV 



VT C E. 



(50) 



By monotonicity of 7, it is always possible to find an N such 
that the rate vector R is achievable. Indeed, R is achievable 
for any N £ [0, Ni], where N\ corresponds to the case that 
for any noise above Ni at least one of the inequalities in d50l l 
turns to equality. Let XJ\ denote the set of users for which the 
corresponding inequality in d50b turns to equality with noise 
level N% (in case of having more than one equality we choose 
the maximum cardinal subset), i.e., 



R(t/i) = 7 



By plugging in ( [50) , we have 



P(E/i) 
Ni 



(51) 



R(T) < 7 



' gen 

v JVi + P(tfi) 



, VT C E\Ui. (52) 



Now, we correspond users in Ui to the output of the Gaussian 
channel yi — x + m, where n\ is additive Gaussian noise 
with variance N\. 

We can apply the same procedure to d52l , i.e., we increase 
Ni until one of the inequalities turns to equality. Let N 2 denote 
the maximum noise level satisfying (l52l with equality. Clearly, 
Ni < N2. If U 2 denotes the set of users satisfying (l52l with 
equality, then we have 



R(C/ 2 ) = 7 



P(t/ 2 ) 



N 2 + P(U 1 ) 



(53) 



By plugging in i52j , we obtain 



R(T) < 7 



pen 



N 1 +P(U 1 ) + P(U 2 ) 



VT C E\Ui U U 2 . 



(54) 

Now, we correspond users in U 2 to the output of the Gaussian 
channel y 2 = x + n 2 , where n 2 is additive Gaussian noise 
with variance N 2 . 

By repeating the above procedure, we can construct a set 
of channels with noise levels N\ < N 2 < ... < Nl and 
associate set of users U\, U 2 , . . . , Ul with E = (Ji=i Uj such 



B.(Ui)-- 



R(T)< 7 



R(T)< 7 




VT C Ui, 



(55) 



(56) 



VT C (J Uj. (57) 

j=i+i 

Now, assume that the transmitter with total power P(T) 
uses K-level Gaussian codebooks for data broadcasting. The 
transmitted signal can be written as x = xi + s. 2 + . . . + xk, 
where x; is a Gaussian codeword with power Pi and rate Ri 
and contains information for Z'th user. The received signal at 
noise level Ni can be written as = x + n«. The set of 
inequalities in d57| i implies that all users at noise level Ni 
can decode data streams of users in |Ji=j+i Uj considering 
everything else as noise. By removing the effect of users in 
Uf=i+i Uj from the received signal, the set of inequalities in 
( f56b implies that all users in Ui can decode their own signal 
considering users in U*=i Uj as noise. In other words, all 
users at the same level of noise can decode their signals by 
first decoding the users at upper levels and removing then- 
effect and considering users at lower levels as Gaussian noise. 
Hence, we obtain a Gaussian broadcast channel in which 
the rate vector R is achievable and Gaussian codebooks are 
constructed according to the power vector P. It remains to 
show that R is on the boundary of the capacity region. The 
capacity region of the Gaussian broadcast channel is fully 
characterized and there is an explicit expression for boundary 
points, c.f. [23]. The equalities in d55l l guarantee that the rate 
vector R lies on the boundary of the capacity region. This 
completes the proof. □ 

Theorem 5: The rate C, the output of Algorithm |2j is the 
capacity of the channel described in ((T). 

Proof: We rewrite the achievable rate given in Algorithm 
|2] by using the Gaussian distribution as codebook generator. 
As discussed earlier, the set of users can be partitioned into 
three subsets V, U U {1}, and W. 

W is the subset of interfering users that the receiver can de- 
code considering everything else as noise. Since the Gaussian 
noise is the worst noise for additive channels, c.f. [24] and 
[25], and W is decodable when other users are considered as 
Gaussian noise, W is decodable for any arbitrary distribution 
for intended user. As a result, interfering users in W can be 
completely eliminated regardless of the input codebook. 

V is the complement of the maximum decodable subset and 
must be considered as noise. From dZTl i. we have 

P(T) 



R(T) > 7 



VT C V. 



(58) 



N + "P{V\T) 

U is the solution to the minimization problem in d43l 
Hence, we have 

f Pi + P(U) 



C + R(J7)=7 



(59) 



iv + p<y) 

We apply Lemma [4] to the set of users in V with associated 
power vector P(V) and rate vector R(V r ). Let Ni < N 2 < 
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. . . < Nl denote the noise levels and U±, U 2 , • ■ • , Ul denote 
the collection of subsets of users associated to each level of 
noise for the Gaussian broadcast channel with the properties 
given in Lemma [4] We claim that Nl < N. To verify this, 
we substitute Ul into ( f55l l and ( f58T l. Hence, we obtain 
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N l + P(V\U L ) 



> 7 



N + P(V\U l ) 



(60) 



which results in Nl < N. 

Next, we add Ul+i = U U {1} as a set of new users to 
the Gaussian broadcast channel with noise level Nl+i = N 
and increase the transmitter's total power by Pi + P(U). It 
is easy to verify that the conditions in ( f55l l. ( TSoT l. and (1571 
are still satisfied with new broadcast channel. Consequently, 
the rate vector lies on the boundary of the capacity region. 
Therefore, reliable data transmission at any rate above C 
results in reliable data transmission outside the capacity region 
which is a contradiction. This completes the proof. □ 

VI. Applications for the M-user Gaussian IC 

In this section, we apply the proposed algorithms to the 
Af-user Gaussian IC modeled by 



M 
Z=l 



(61) 



where Xj is the transmitted symbol of user j and h%j denotes 
the link's gain between the j'th transmitter and the i'th 
receiver. z% is additive white Gaussian noise with zero mean 
and variance N\. User i is also subject to an average power 
constraint Pi. The capacity region of this channel is denoted 
by c £gic- 

It is more convenient to write the system model in matrix 
form as 

y = H X + z, (62) 

where y = [yx, y 2 , . . . , yn] T and x = [xx, x 2 , ■ ■ ■ , x M } T 
denote the output and input vectors, respectively. H = [hij] 
is the matrix of links' gains, and z = [zx, z 2 , ■ ■ ■ , zm] T is the 
Gaussian noise vector which has a diagonal covariance matrix. 
By scaling transformations, it is possible to write the channel 
model (l62l in standard form where the noise variances and 
diagonal elements of H are one [26]. 

Let us assume each transmitter is allowed to transmit data 
by using a single Gaussian codebook and each receiver is 
allowed to decode any subset of interfering users. Let ^ 
denote the set of decoding strategies. By a decoding strategy 
ip = {Si, S 2 , ■ ■ ■ , Sm} € "J, we mean that the receiver i tries 
to decode all users' data in Si. Clearly, Si C E and i 6 Si. 
Since there are 2 M ~ 1 possible choices for each Si, we have 
2 M(M-i) p 0SS ibi e strategies in total. Hence, |*| = 2 M( - M ~ 1 h 

Given a strategy, a rate vector R is achievable with respect 
to that strategy if every receiver can reliably decode its 
associated users. Therefore, an achievable rate region can 
be defined as a set of all rate vectors that are achievable 
with respect to the strategy ip. Let c € — Ui/xe*'^/'' Clearly, 
Q ^gic an d it can be shown that ^ is not convex in 
general. 



A. Some Extreme Points of ^ 

Given an ordering 7r of users, we aim at maximizing users' 
rates in accordance with it. In general, there are Ml orderings 
of users which result in Ml not necessarily distinct achievable 
rates in the capacity region. Due to the polymatroidal property 
of the capacity region of the Gaussian MAC, every permuta- 
tion leads to a distinct achievable rate vector; whereas, ^ is 
not a polymatroid and hence there may be some permutations 
that lead to the same achievable rate vector. Without loss of 
generality, we may assume the order is the same as that of 
users' indices, i.e., permutation matrix is identity. 

Setting the first user's rate to its maximum value Ri = 
7 (Pi) imposes some constraints on the other user's rates as 
they must be decoded by the first receiver. The reason is 
that Ri is achievable if the first receiver can decode all the 
interfering users by considering its own signal as noise and 
eliminating their effects from the received signal. 

Maximization of the second user's rate is more delicate, 
since its transmission should not affect the first user's data 
rate. However, we have the choice of lowering other users' 
rates as much as needed. Hence, we assume users in the set 
{3, 4, . . . , M} are decoded at the first and second receivers 
by considering everything else as noise and their effects are 
removed. R 2 must be chosen such that both receivers can 
decode it. The maximum decodable subset at the first receiver 
is {1} by the assumption. For the second user, we can find the 
maximum decodable subset of interfering users which in this 
case is either or {1}. Now, we can run Algorithm |2] at both 
receivers to find an achievable rate for each receiver. Clearly, 
the minimum of the two achievable rates are achievable and 
we set R 2 to this value. Besides, we obtain the strategy 
ip( ' — {S{ ,S 2 } in which Ri and R 2 are achievable, where 
S{ 2) , S { 2 2) C E<® and ={1,2,..., i}. 

To maximize the rate of user i, we proceed as follows. 
We treat users above index i as they do not exist, i.e., we 
put constraints on their rates in such a way that all the 
receivers with indices in E^ % ' can decode them first and 
remove their effects. From maximization of users' rates in 
the previous steps, we have R B (i-ij and its corresponding 
achievable strategy t// 4-1 ) = {S^ ^,...,5^1^}, where 
Sf~ 1] C E^\ Mj e E^-V. Ri must be chosen such that 
all receivers in can decode it. The maximum decodable 
subset of interfering users is given by V^ 1 ' for all receivers 
in 

£(<-!), We can also find the maximum decodable subset 
of interfering users at receiver i by running Algorithm Q] Let 
S 1 ,- 4 denote this subset. From (|43T >. Ri is achievable at the 
receiver j £ E^\ if it is less than Rij which is defined as 



R 



ucs 



mm 7 

<i-i) 



i + E 



-K(U). 
(63) 

Hence, Ri can be chosen as the minimum of all i?y s. For the 
next step, we need a new achievable strategy. It is easy to see 
that - {S ( r 1} U {z}, . . . , S£i 1) U {1}, Sf-V U {1}} is 
the proper strategy at step i. Now, we can iterate until the last 
user. 

Algorithm 3 (successive maximization of users' rates): 
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1) Set #1=7 (Pi) and sf ) = {1}. 

2) For i = 2 : M, do: 

a) Find the maximum decodable subset of interfering 
users in the set for receiver i assum- 
ing that users in the set E\E^ l ~^ are decoded and 
their effects are removed. 

b) Solve the following optimization problem 

Ri = min ILa, (64) 

where iJy is defined in d63t . 

c) = U {i}, for all j 6 £«. 

For the sake of completeness, in the following theorem, we 
state that the above algorithm finishes in polynomial time. 

Theorem 6: Algorithm [3] converges to an extreme point of 
in polynomial time. 
Proof: At the i'th iteration, we need to solve i sub- 
modular function minimizations. Hence, in total, a submodular 
function minimization subroutine is invoked for M(M + l)/2 
times. Moreover, at each step, we need to find the maximum 
decodable subset which can be accomplished in polynomial 
time based on Theorem Q] Hence, Algorithm [3] converges to 
an extreme point of ^ in polynomial time. □ 

It is worth noting that for the two-user Gaussian IC in 
the case of strong and very strong interference [27], the 
output of Algorithm [3] is a point on the boundary of the 
capacity region. In the case of weak interference, however, 
the output of Algorithm [3] coincides with Costa's result in 
[28]. Unfortunately, the optimality of the result claimed by 
Costa has not been proved yet [27]. As a result, proving the 
optimality of extreme points obtained from Algorithm [3] has at 
least the same level of difficulty as that of the two-user case. 

B. Generalized One-sided Gaussian IC 

Parallel to the definition of the one-sided Gaussian IC [28], 
we define the generalized one-sided Gaussian IC as one in 
which the channel matrix H can be represented as a triangular 
matrix by row permutations. For the sake of simplicity, we 
always assume that H is lower triangular. Hence, the first user 
incurs no interference from other users, i.e., y\ = x\ + Z\, the 
second user incurs interference only form the first user, i.e., 
U2 = I121X1+X2 + Z2, and in general, user i incurs interference 
from preceding users, i.e., y\ = huXi + . . . + hiU—iyXi-i + 

X{ -\- 

From optimality of the proposed achievable rate region for 
the Gaussian case, we can prove the following theorem. 

Theorem 7: For the generalized one-sided Gaussian IC, 
successive maximization of users' rates gives an achievable 
rate vector which lies on the boundary of the capacity region. 

Proof: We can prove this theorem by induction on the 
number of users. For a single user, it is trivial that maximizing 
user's rate result in transmission at the capacity of the channel. 
Now, we assume that for m — 1 users, successive maximization 
of users' rates gives us a point R B ( m -i) on the capacity region. 
Due to Theorem|5] user m, which does not interfere with other 
users and only receives interference from other users, transmits 
at a rate R m which is the capacity of its channel assuming 



other users use Gaussian codebooks for data transmission. 
Since no further increasing of any rate is possible, the rate 
vector R = [R B ( m -i) , Rm] lies on the boundary of the 
capacity region of the m-user Gaussian IC. □ 

The capacity region of strong and very strong two-user 
Gaussian ICs is known and corresponds to the capacity of the 
corresponding compound MAC where both receivers decode 
both users' messages [2] [3]. Therefore, for the M-user case, 
it is interesting to find similar situations where the capacity 
is achievable when all receivers decode all messages sent by 
all transmitters. However, by a counter example, it is easy 
to show that having the condition hfj > 1, Vi,jEE,is not 
sufficient to establish similar results. To find similar situations, 
we define the strong generalized one-sided Gaussian IC as the 
channel with triangular channel matrix H with the property 
that hf k > h 2 - k whenever i > j. In the following theorem, 
we prove that the capacity region of the strong generalized 
one-sided Gaussian IC can be fully characterized. 

Theorem 8: The capacity region of the strong generalized 
Z Gaussian IC is f] ieE CMAc{i), where CMAc{i) denotes the 
capacity region of the MAC seen at the ith receiver. 

Proof: This theorem can be also proved by induction 
on the number of users. For a single user, it is trivial. We 
assume that for a channel with m — 1 users and a triangular 
channel matrix, the capacity region is DieM-fro} ^MAc(i)- 
Now, we add a new user which does not interfere with other 
users and only receives interference from all other users. Let 
Cqic denote the capacity region of Af-user Gaussian IC. It 
suffices to show that for any rate vector R = [R_ rn ,i? m ] in 
Cgic(™)> receiver m is able to decode all users' messages. 
The idea that we use here is similar to the idea of Han and 
Kobayashi for proving the capacity region of strong and very 
strong two-user Gaussian ICs [4]. Since R_ m is achievable 
and there is no interference from user m, we have R_ m 6 
r\i£E\{m} C MAc{i)- In particular, R m e C M Ac(m - 1). 
Hence, receiver m—1 which has y m -i = fym-m^i + ■ • ■ + 
h{m-i){m-2)%m-2 + x m -l + z m-i as the received signal 
can jointly decode all users in the set E\{m}. Since i? m is 
decodable by the mth receiver, it can be removed from the 

received signal y m = h ml xi^ \-h M ^ m _^x m -i+x m +z m . 

Now, receiver m can try to decode other users' data from 

Vm = hm\X\ H h /i m ( m -i)2: m -i + z m . Let C M Ac(ni - 1) 

denote the capacity of this MAC. By hypothesis, h? ih > h^ k 
whenever i > j. Therefore, Cmac^ 1 — 1) C CMAc( m — !)■ 
Hence, receiver m is able to decode the rate vector R- m . This 
completes the proof. □ 

VII. Conclusion 

We investigated data transmission over a channel with 
M — 1 interfering users. By establishing certain properties 
of the maximum decodable subset, we proposed a polynomial 
time algorithm that separates the interfering users into two 
disjoint parts: the users that the receiver is able to jointly 
decode their messages and its complement. We introduced 
an optimization problem that gives an achievable rate for this 
channel. We proposed a polynomial time algorithm for solving 
this optimization problem. We also established the capacity of 



11 



the additive Gaussian channel with Gaussian interfering users 
and showed that the Gaussian distribution is optimal and the 
proposed achievable rate is the capacity of this channel. 

As an application of this method, we investigated data 
transmission for the case of M-user interference channel when 
transmitters use single codebooks for data transmission, and 
receivers are allowed to decode other users' messages. We then 
introduced an achievable rate region c € . We obtained some 
extreme points of ^ by successive maximization of users' 
rates. We also characterized one point on the capacity of the 
generalized one-sided Gaussian IC. Finally, we obtained the 
capacity region of the strong generalized one-sided Gaussian 
IC. 
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